Digit Recognition Using the SPEECHDAT Corpus
نویسندگان
چکیده
With the remarkable evolution of telecommunications as we reach the end of this century, it becomes clear that speech recognition via the telephone network will play an increasingly important role, mainly due to the widespread use of both cellular and non-cellular telephones. For many applications of speech recognition over the telephone, digit recognition is fundamental. This paper describes a set of digit recognition experiments with the SPEECHDAT corpus for European Portuguese. We present techniques and results obtained with isolated and connected digits with both known and unknown length grammars. Error rates of 0.6% and 1,9% were achieved, respectively, for isolated digit and connected digit strings.
منابع مشابه
Quantile based histogram equalization for online applications
The noise robustness of automatic speech recognition systems can be increased by transforming the signal to make the cumulative density functions of the signal’s values in recognition match the ones that where estimated on the training data. This paper describes a real–time online algorithm to approximate the cumulative density functions, after Mel scaled filtering, using a small number of quan...
متن کاملDevelopment of the estonian speechdat-like database
A new database project has been launched in Estonia last year. It aims the collection of telephone speech from a large number of speakers for speech and speaker recognition purposes. Up to 2000 speakers are expected to participate in recordings. SpeechDat databases, especially Finnish SpeechDat, have been chosen as a prototype for the Estonian database. It means that principles of corpus design...
متن کاملDevelopment of a Real-time Asr System for Slovak Speechdat Database
This paper describes development of a real-time speech recognition system in Slovak for the voice-operated telephone services. The system is based on SPHINX2 platform. The decoder using Hidden Markov Models was trained on the SpeechDat-E Slovak database. It is speaker independent, large vocabulary, continuous speech real-time automatic speech recognition system. Test results are given for the t...
متن کاملMonolingual and Bilingual Spanish-Catalan Speech Recognizers Developed from SpeechDat Databases
Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catalan database that includes one thousand speakers. This communication describes some experimental work that has been carried out using both the Spanish and the Catalan speech material. A speech recognition system has been trained for the Spanish language using a selection of the phonetically balance...
متن کاملSome Experiments on the Use of One-channel Noise Reduction Techniques with the Italian Speechdat Car Database
In this work the use of noise reduction techniques for handsfree speech recognition in car environment is investigated. A set of experiments was carried out using different speech enhancement algorithms based on noise estimation. In particular, linear subtraction and MMSE estimators are considered in their various configurations, which depend on a different set of parameters. Experiments were c...
متن کامل